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Abstract 

Using a recursive approach, we obtain a simple exact expression for the L 2 -distance from 
the limit in Regnier's [5] classical limit theorem for the number of key comparisons required 
by Quicksort. A previous study by Fill and Janson p] using a similar approach found 
that the (fe-distance is of order between n _1 logn and n~ 1//2 , and another by Neininger 
and Ruschendorf [4] found that the Zolotarev (^-distance is of exact order n _1 logn. Our 
expression reveals that the L 2 -distance is asymptotically equivalent to (2n _1 lnn) 1//2 . 



1. Introduction, review of related literature, and summary 

We consider Hoare's [3] Quicksort sorting algorithm applied to an infinite 
stream of iid (independent and identically distributed) uniform random variables 
Ui, U2, ■ ■ ■ ■ Quicksort chooses the first key U\ as the "pivot" , compares each of the 
other keys to it, and then proceeds recursively to sort both the keys smaller than 
the pivot and those larger than it. If, for example, the initial round of comparisons 
finds U2 < Ui, then U2 is used as the pivot in the recursive call to the algorithm 
that sorts the keys smaller than U\ because it is the first element in the sequence 
Ui, U2, ■ ■ ■ which is smaller than U\. In a natural and obvious way, a realization 
(requiring infinite time) of the algorithm produces an infinite rooted binary search 
tree which with probability one has the completeness property that each node has 
two child-nodes. 

Essentially the same algorithm can of course be applied to the truncated sequence 
Ui, U2, ■ ■ ■ , U n for any finite n, where the recursion ends by declaring that a list of 
size or 1 is already sorted. Let K n denote the number of key comparisons required 
by Quicksort to sort U\, U2, • • ■ , U n . Then, with the way we have set things up, 
all the random variables K n are defined on a common probability space, and K n is 
nondecreasing in n. Indeed, K„ — K n -\ is simply the cost of inserting U n into the 
usual (finite) binary search tree formed from U\, . . . , U n -\. 

In this framework, Regnier [5] used martingale techniques to establish the fol- 
lowing L p -limit theorem; she also proved almost sure convergence. We let 

(M n := EA'„. 
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Theorem 1.1 (Regnier [5]). There exists a random variable T satisfying 

y . = K " - Mrt L" j, 

n + 1 

for every finite p. 

Rosier [B] characterized the distribution of Rcgnicr's limiting T as the unique 
fixed point of a certain distributional transformation, but he also described explic- 
itly how to construct a random variable having the same distribution as T. We 
will describe his explicit construction in equivalent terms, but first we need two 
paragraphs of notation. 

The nodes of the complete infinite binary search tree are labeled in the natural 
binary way: the root gets an empty label written e here, the left (respectively, right) 
child is labeled (resp., 1), the left child of node is labeled 00, etc. We write 
:= Uo<fc<oo{0, l} fe for the set of all such labels. If Vg denotes the key inserted at 
node 9 £ O, let Lg (resp., Rg) denote the largest key smaller than Vg (resp., smallest 
key larger than Vg) inserted at any ancestor of with the exceptions Lg := and 
Rg := 1 if the specified ancestor keys do not exist. Further, for each node 9, define 

(f)g := Rg - Lg , U$ := 4>eo/ '<j>e, 

(1.1) Gg := 4>gC(Ug) — 4>g - 2(j)g In 4>g + 2(j)g In 060 + 2(j)gi ln^ei, 

where for < x < 1 we define 

(1.2) C{x) := l + 2xlnx + 2(l-a5)ln(l-a;). 

Let 1 < p < oo. The d p -metric is the metric on the space of all probability 
distributions with finite pth absolute moment defined by 

d p (F u F 2 ) := inf ||X 1 -X 2 || P , 

where we take the infimum of L p -distances over all pairs of random variables Xi and 
X2 (defined on the same probability space) with respective marginal distributions 
Fi and F 2 . By the dp-distance between two random variables we mean the dp- 
distance between their distributions. 

We are now prepared to state Rosler's main result. Note: Here and later results 
have been adjusted slightly as necessary to utilize the same denominator n+1 
(rather than n) that Regnier used. 

Theorem 1.2 (Rosier [5]). For any finite p, the infinite series Y — ^2jLo Yl\e\=j *~*8 
converges in L p , and the sequence Y n = (K n — /z n )/(n+l) converges in the d p -metric 
to Y. 

Of course it follows from Theorems I1.1H1.2I that T and Y have the same distri- 
bution. The purpose of the present extended abstract is to show that in 
fact T = Y and to provide a simple explicit expression for the L 2 -distance 
between Y n and Y valid for every n; this is done in Theorem 11.41 below. 

We are aware of only two previous studies of the rate of convergence of Y n 
to Y, and both of those concern certain distances between distributions rather 
than between random variables. The first study, by Fill and Janson pQ, provides 
upper and lower bounds on d p (Y n , Y) for general p; we choose to focus here on d 2 . 

Theorem 1.3 (Fill and Janson [1]). There is a constant c > such that for any 
n > 1 we have 

en -1 Inn < d 2 {Y ni Y) < 2n~ 1/2 . 
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To our knowledge, the gap between the rates (log n)/n and n^ 1 ' 2 has not been nar- 
rowed. Neininger and Ruschendorff [4] used the Zolotarev (^-metric and found that 
the correct rate in that metric is logn, but their techniques are not sufficiently 
sharp to obtain (^(Yn, Y) ~ cn _1 Inn for some constant c. 

In our main Theorem 11.41 proved using the same recursive approach as in 
Fill and Janson [1] , we find not only the lead-order asymptotics for the L 2 -distancc 
Iji^n — Y ||2, but in fact an exact expression for general n. It is interesting to note that 
the rate n _1 / 2 (log n) 1 / 2 for L 2 -convergence is larger even than the upper-bound rate 
of nT 1 / 2 for (^-convergence from Theorem 1 1.31 

Theorem 1.4 (main theorem). For n > we have 

\\Y n -Y\\l = {n + l)- 1 (2H n + l + -^—) -4 V fc~ 2 = 2— + O (- 
\ n + 1 / / — 1 n \n 

v 7 k=n+l V 

where H n := X}j=ii * s the n ^ n harmonic number and the asymptotic expression 
holds as n — » 00. 

The remainder of this extended abstract is devoted to a proof of Theorem 11.41 
which is completed in Section [5l 

2. Preliminaries 

In this section we provide recursive representations of Y n (for general n) and Y 
that will be useful in proving Theorem 11.41 Our first proposition concerns the 
limit Y and gives a samplc-pointwise extension of the very well known [5] distribu- 
tional identity satisfied by Y. Recall the notation (jl.ll) and the definition of Y in 
Theorem 11.21 as the infinite series J^JLo ^2\e\=j ^6 in L 2 . 

Proposition 2.1. There exist random variables Fg and Hg for 9 € such that 

(i) the joint distributions of (Gg : 9 £ 0) ; of (Fg : 9 £ 0), and of (Hg : 9 6 0) 
agree; 

(ii) (Fg : 9 £ O) and (Hg : 9 £ 0) are independent; 

(iii) i/ie series 

00 00 

(2.1) ^ (0) :=EE^ and r(1): =EE^ 

j=o |0|=j 3=0 \e\=j 

converge in L 2 ; 

(iv) the random variables Y^ and Y^ are independent, each with the same 
distribution as Y , and 

(2.2) Y = C(U) + C/r (0) + UY {1) . 

Here U := Ui, with U := 1 — Ui, and C is defined at (|1.2j) . 
Proof. Recall from (|1.1|) that 

Gg = (f>g — 2(/)g \n.(/)g + 2(f)g \n(j>g + 2<j)g\ \n<f>$\. 
For £ 0, define the random variable tpg (respectively, ?pg) by 
fe ■= 4>oe/U (resp., ipg := 4>w/U). 



4 



PATRICK BINDJEME JAMES ALLEN FILL 



Then U and ipg are independent (resp., U and ipg are independent), ipg and ipg each 
have the same distribution as <pe, and 

G Q g = UF g and G w = UH e , 

where 

Fg := Ifig - 2ipg\\\ipg + 2ip eo In lfg Q + 2lfgilriipgi, 
Hg := ipg - 2lpg In ipg + 2ip eo In lpg + 2lp ei In Ipgi . 

The proposition follows easily from the clear equality 

C{F g : G 6) = C{Gg :8eQ) = C(Hg : 9 £ O), 

of joint laws and the fact that 

oo oo oo 

3=0 \g\= 3 3=0 \g\ =j 3=0 \e\=3 

OO OO 

= C(U) + Uj2Y, F e + uT,J2 H <> 

j=o \g\ =j 3=0 \e\= 3 

= C(U) + UY (0) + UY (1) . 

□ 

We next proceed to provide an analogue [namely, (|2.4[) ] of (|2.2[) for each Y„, 
rather than Y, but first we need a little more notation. 

Given < x < y < 1, let {U* y ) n >i be the subsequence of {U n ) n >i that falls 
in (x,y). The random variable K n (x,y) is defined to be the (random) number of 
key comparisons used to sort Ui V , . . . , U* v using Quicksort. The distribution of 
K n (x,y) of course does not depend on (x,y). 

We now define the random variable 

(2-3) Y nfi := [K ue{n) {Lg,R g ) - fc ( ( n )]/[i/ 9 (n) + 1], 

with the centering here motivated by the fact that [i Vf) {n) is the conditional expec- 
tation of K ve ( n }(Lg, Rg) given (vg(n),Lg,Rg). Then for n > 1 we have 

(2.4) Y n = -JL- C nM n) + 1) + ^4±i y ni0 + F Bll> 
n + 1 n + 1 n + 1 

where, as in [2], for 1 < i < n we define 

C n (i) := ^(n - 1 + /Ui-l + /i„-i - Hn). 

We note for future reference that the classical divide-and-conquer recurrence for [i n 
asserts precisely that 



(2.5) ^C„(i)=0 

i=l 

for n > 1. 
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It follows from (12.21) and (12.41) that for n > 1 we have 



Y n -Y 



n + 1 



vi{n) + 1 

71+1 



Y n .j - UY 



(1) 



+ 



(2.6) 



71+ 1 
1^1 + W 2 + W 3 



C n {v {n) + 1) - C{U) 



Conditionally given U and vo(n), the random variables W\ and W2 are independent, 
each with vanishing mean, and W3 is constant. Hence 

E[(F„ - Y) 2 I [/, i*(n)] = E[W^ 2 | [/, i*(n)] + E[W 2 2 | E7, i*(n)] + W 2 

and thus, taking expectations and using symmetry, for n > 1 we have 

(2.7) a 2 := E(F„ - y) 2 = E Wf + E W| + E W 3 2 = 2E Wj 2 + E W 3 2 . 
Note that 

(2.8) a 2 = EF 2 = cr 2 := 7- |tt 2 
(for example, [2]). 



3. Analysis of E W 2 

In this section we analyze ~EW 2 , producing the following result. Recall the 
definition of a 2 at 



Proposition 3.1. Let n> 1. For W\ defined as at (|2 .6[) . we have 

^ n-l 9 



E^ 2 = , ^(fc + l) 2 « 2 



n(n+l) 2 ^ v ' 6(71 + 1) 

For that, we first prove the following two lemmas. 
Lemma 3.2. For any n > 1, we have 

2 



E 



Lemma 3.3. For (my n > 1, we have 



E 



u (n) + 1 



6(n+l) 



Proof of Lemma \3.2X There is a probabilistic copy Y* = (Y*) of the stochastic 
process (Y n ) such that 

Y n ,o = Y Uo ( n -) 

and Y* and Y^ are independent of ([/, fo(n)). This implies 



E 



E 



M^)±J_\ (y* _ F (0) N2 

n + 1 I "°< n) 



(i 
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By conditioning on vo(n), which is uniformly distributed on {0, 



E 



v {n) + 1 
n+1 



(Y n ,o 



y(0) 



E 



u Q (n) + 1 
n + 1 



.,n 



V (n) 



1}, we get 



= -J2 



k=0 



k + 1 
n+1 



□ 



Proof of Lemma 13.31 Conditionally given i/o (n) and Y^°\ we have that U is 
distributed as the order statistic of rank vo(n) + 1 from a sample of size n from 
the uniform(0, 1) distribution, namely Beta(i/o(n) + l,n— Uo(n)), with expectation 
[vo(n) + 1} / (n + 1) and variance [(^(n) + l)(n — vo(n))]/[(n + l) 2 (n + 2)]. So, using 
also the independence of uq(ti) and Y^°\ we find 



E 



Vo{n) + 1 
n+1 



- U 



y(o) 



E 



= cr 2 E 



{v {n) + l)(n - u (n)) / (0) 
(n+l) 2 (n + 2) I 

x - + l)(n - ft) = 



(z/ (n) + - v (n)) 



(n + l) 2 (n + 2) 
- 2 1 



(n + l) 2 (n + 2) n 



n(n+l) 2 (n + 2) 6 



x -n(n + l)(n + 2) 



6(n + l) 



□ 



Proof of Proposition \3.1\ We have 
E Wi = E 



E 



E 



n + 1 

^o(rc) + 1 
n + 1 

v [n) + 1 
n + 1 



Y n n — Y 



(0) 



+ E 



""W + 1 (V„ „ - y"A ( "° ( " ) + 1 - tr^ y'»» 

n + 1 V / \ n + 1 / 



2E 



The result follows from Lemmas I3.2H3.3[ and the fact that, conditionally given 
(vo(n), Y n fi, Y^). the random variable U is distributed Bcta(i/o(n) + 1, n — ^o(n)), 
so that the last expectation in the preceding equation vanishes. □ 



4. Analysis of E W% 
In this section we analyze E Wf, producing the following result. 
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Proposition 4.1. For any n > 1 we have 

where H n = Y^j=i i _1 * s ^ e n th harmonic number and := 5Zj=ii _2 * s ^ e 

nth harmonic number of the second order. 

For that, we first prove the following two lemmas. 

Lemma 4.2. For any 1 < k < n we have 

D(n, k) := mn l _ k + l) - tT-H^t) dt = H k -i - H n , 

where B is the beta function. 
Lemma 4.3. For any n > 1 we have 

E[C n (u (n) + l)C{U)] = -^—E[C n {v {n) + I)] 2 - 

n + 1 

Proof of Lemma The result can be proved for each fixed n > 1 by back- 
wards induction on k and integration by parts, but we give a simpler proof. Recall 
the defining expression 

B(a,[3) = f ^{l-tf^dt 
Jo 

for the beta function when a, /? > 0. Differentiating with respect to a gives 

f t a - l {l-t) l5 - 1 {\nt)dt = B{a 1 p)[ij{a)~iP{a + p% 
Jo 

where "0 is the classical digamma function, i.e., the logarithmic derivative of the 
gamma function. But it is well known that = -ffj-i for positive integers j, so 
the lemma follows by setting a = k and ft = n — k + I. □ 

Proof of Lemma \4-3\ We know that Vq(u) + 1 ~ unif{l, 2, . . . , n} and that, con- 
ditionally given z^o(n), the random variable U has the Beta(z^o(n) + l,n — vq(ji)) 
distribution. So from Lemma \A. 21 repeated use of (|2.5[) , and the very well known 
and easily derived explicit expression 

Hn = 2(n + l)H n - An, n > 0, 
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wc haw 

E[C„Oo(n) + l)C(?7)] 

i n i r 1 

n . _ . j 

77 7?/ I _L 77 | _L 

l " 

= , +1) E Cn(j)[2jfli + 2(n - j + l)tf„- 3 -+i] 
1 " 

= , +1) E Cn(j)[2j^-i - 4(j - 1) + 2(n - j + l)^ - 4(n - j)] 

j = l 

^ 71 1 72 

= n(n + 1) E ^O'MW-l + Mn-j] = , ~ x) E C ™0')[W-1 + Mn-j - ^ 
3=1 3=1 

ra 1 



1 x nJ2 C ^ = —^nCn(Mn) + I)] 2 , 



3=1 



as desired. 



□ 



Proof of Proposition \4-l\ It follows from Lemma 14.31 that 

1 2 



K =E 



71+ 1 



71 + 1 

EC([/) 2 



C n (i^(n) + l)-C(C0 



E[C, i (^(n) + l)] 2 -2 



n+ 1 



E[C„(n,(n) + + E C{Uf 



E[C„W«)H)f 



n + 1, 

Knowing that EC(C/) 2 = <7 2 /3, and from the proof of Lemma A. 5 in [5] that 

E [C n (Mn) + I)] 2 = f (1 + if - § (1 + I ) (1 + i) ~ ln-3H n , 
wc have 



- 7 4/n + 2 



3 V n + 1 



i?l 2) 



3n(n+ 1) : 



as claimed. 



□ 



5. A CLOSED FORM FOR ol 



In this final section we complete the proof of Theorem 11.41 for which we need 
one more lemma. 

Lemma 5.1. For Hn = Sj=i 3 2 > ^ e harmonic number of the second order, 
we have 

n 
3=1 

/or any nonnegative integer n. 
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The lemma is well known and easily proved. 

Proof of main Theorem \l-4\ For n > 1 we have from the decomposition (|2.7[) 
and Propositions 13.11 and 14.11 that 

9 n—l 
2 Z Wl, i 1 ^2„2 



7 4 



#(2) + I iJ 



3 3 \n + lj " 3n(n+ l) 2 
and we recall from (|2.8j) that a 2 = er 2 . Setting x„ := (n + l) 2 a 2 , we have xq = a 2 



and 



with 



2^ 

- } x k + c„ for n > 1, 

fc=0 



c« := y (n + 2)(n + 1) - ? -(n + l) 2 + ^(n + 2)(n + + ^ff n . 

This is a standard divide-and-conquer recurrence relation for i n , with solution 

^ fccfc - (k - l)cfe_i 



= (n + 1) 



(7 



*(* + !) 



n > 0. 



a 



2 = (n + l)" 1 ( 2H n + 1 + — ^— ) + a 2 - 7 + 4i^ 2 > 



After straightforward computation involving the identity in Lemma 15.11 one finds 

6 ' 
n + I, 

as claimed. □ 
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